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INEQUALITIES  ON  DISTRIBUTIONS: 
BIVARIATE  AND  MULTIVARIATE 


ground.  Subjects  treated  include  inequalities  arising  through 


positive  and  negative  dependence;  Boole,  Bonferroni  and  Fr6chet 


inequalities;  convex  symmetric  set  inequalities;  stochastic 


ordering;  stochastic  majorization  and  inequalities  obtained  by 


majorization ;  Chebyshev  and  Kolmogorov-type  inequalities;  multi¬ 


variate  moment  inequalities;  and  applications  to  simultaneous 


inference,  unbiased  testing  and  reliability  theory. 
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1.  Introduction 


Inequalities  have  played  a  central  and  lasting  role  in  prob¬ 
ability  theory  as  well  as  in  mathematics  in  general.  In  mathema- 
ics,  the  subject  of  inequalities  has  centuries  old  roots  and  many 
prominent  researchers  have  contributed  to  its  growth.  The  classic 
book  by  Hardy,  Littlewood  and  Polya  (1952),  first  published  in 
1934,  provides  a  remarkable  compendium  of  mathematical  inequal¬ 
ities.  While  the  development  of  probability  inequalities  is 
intertwined  with  the  mathematical  development,  there  have  been 
separate  major  influences  as  a  result  of  the  special  needs  of 
probability  theory.  For  instance  one  inequality  common  to  both 
mathematics  and  probability  is  the  famous  Cauchy-Schwarz  in¬ 
equality  which  in  its  probabilistic  form  states  that 
Cov(X,Y)  (Var  X)^(Var  Y)^.  On  the  other  hand,  Chebyshev's  im¬ 
portant  inequality  germinated  in  the  context  of  probability  theory, 
being  developed  in  order  to  approximate  probabilities  of  complex 
events . 

The  origins  of  probability  inequalities  for  multivariate 
distributions  are  not  new  (for  instance,  Boole's  inequality  and 
also  the  Cauchy-Schwarz  inequality).  However,  the  dramatic  growth 
of  this  subject  area  has  taken  place  in  the  last  half  of  this 
century.  This  growth  parallels  the  major  growth  of  multivariate 
analysis  itself  during  this  period.  Recently  there  have  been 
some  attempts  to  impose  structure  on  the  area  of  multivariate 
probability  inequalities.  Two  notable  efforts  are  the  mono¬ 
graphs  of  Marshall  and  Olkin  (1979)  and  Tong  (1980).  Also  see 
the  review  paper  of  Eaton  (1982).  The  multivariate  inequalities 


/ 


2 


presented  here  are  divided  roughly  into  three  groupings,  namely, 
inequalities  among  random  variables,  stochastic  comparisons 
between  random  vectors,  and  moment  inequalities. 

While  multivariate  probability  inequalities  are  important 
in  their  own  right,  equally  important  is  the  application  of  these 
inequalities  to  problems  in  statistics.  The  context  in  which  they 
are  most  useful  is  dealing  with  probabilities  for  random  vectors 
having  complex  dependencies.  In  many  such  situations,  the  eval¬ 
uation  of  the  probabilities  is  technically  virtually  impossible, 
so  that  bounds  for  these  probabilities  become  highly  useful. 

Three  specific  areas  of  extensive  application,  which  are  dis¬ 
cussed  in  this  paper,  are  reliability  theory,  simultaneous  in¬ 
ference  and  unbiased  testing. 

2.  Positive  and  Negative  Dependence 

Various  inequalities  arise  when  the  entries  of  a  random  vec¬ 
tor  X = (X, , . . . ,X  )  are  positively  dependent.  (See  Barlow  and 
Proschan  (1975)  for  a  discussion.)  For  example,  there  are  many 
circumstances  where  for  all  Xj,...,x  , 

P 

Pr[X,  >  x, . X  >x  3>  11  PrCX,  >x.  3.  (1.1) 

x  j.  P  P  i=l  1  1 

This  is  a  type  of  positive  dependence  called  positive  upper 
orthant  dependence  (PUOD).  A  variant,  positive  lower  orthant 
dependence  (PLOD),  is  obtained  by  replacing  all  ">"  by  in 
(1.1).  In  the  bivariate  case,  considered  by  Lehmann  (1966), 

PUOD  and  PLOD  are  equivalent;  a  discussion  of  the  more  general 
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case  is  given  by  Tong  (1980).  A  different  positive  dependence 

concept  which  implies  both  PUOD  and  PLOD  is  association.  The 

entries  of  X  are  said  to  be  associated  if  Cov( f (X)g(X) )  > 0  for 

all  binary  valued  functions  f  and  g  which  are  nondecreasing  in 

each  argument  (see  Barlow  and  Proschan  (1975)).  A  simple  checkable 

condition  which  implies  association  is  called  TP2  in  pairs. 

See  Block  and  Ting  (1981)  for  a  review  of  this  and  other  positive 

dependence  concepts,  their  relations,  and  references. 

A  concept  of  negative  dependence,  negative  upper  orthant 

dependent  (NUOD),  is  obtained  if  ">"  is  replaced  by  in 

(1.1).  For  example,  the  multinomial  distribution  is  NUOD. 

Various  other  distributions  which  have  the  same  structure  as  a 

P 

multinomial  (i.e.,  essentially  J  Xj  being  constant)  also  are  NUOD 

i=l 

(see  Block,  Savits  and  Shaked  (1982)).  Other  concepts  of  nega¬ 
tive  dependence  are  discussed  by  Karlin  and  Rinott  (1980)  and 
Ebrahimi  and  Ghosh  (1981). 

Concepts  derived  from  (1.1)  can  be  used  to  partially  order, 
according  to  degree  of  positive  dependence,  random  vectors  whose 
one-dimensional  marginal  distributions  agree.  The  random  vector 
X  is  more  PUOD  than  Y  if  Pr[X,  >  t, , . . . ,X  >  t  ]  >  Pr[Y,  >  t, , . . . ,Y  >  t  ] 

—  —  1  1  p  p  —  1  1  pp 

for  all  t, . t  .  If  ">”  is  replaced  by  "<"  in  the  above,  then 

—  P 

X  is  said  to  be  more  PLOD  than  Y.  The  bivariate  version  of  the 
more  PLOD  ordering  was  first  described  by  Tchen  (1980). 

When  X~N(0,E^),  then  (a)  the  density  is  TPg  in  pairs  if 
and  only  if  A  *  satisfies  the  condition  for  all  i^j 


(Barlow  and  Proschan  (1975));  (b)  X  is  associated  if  and  only 

if  is  a  nonnegative  matrix  (Pitt  (1982));  (c)  X  is  PUOD  and 
PLOD  if  and  only  if  £x  is  a  nonnegative  matrix  (this  follows  from 
(b));  and  (d)  X  is  NtJOD  if  and  only  if  the  off-diagonal  elements 
of  Z^  are  nonpositive  (Block  et  al  (1981)).  If  Y~N(0,XY),  where 
the  diagonal  elements  of  and  Ly  are  the  same  and  £x  -  Z^  is 
a  nonnegative  matrix,  then  X  is  more  PUOD  (Das  Gupta  et  al  (1972, 
Remark  5.1))  and  also  more  PLOD  (Slepian  (1962))  than  Y.  The 
density  of  |X1|,...,|X  |  is  TP2  in  pairs  if  and  only  if  there 
exists  a  diagonal  matrix  D  with  elements  +  1  ,  such  that  the  off- 
diagonal  elements  of  DE^D  are  all  nonpositive  (Karlin  and  Rinott 
(1981),  and  Abdel-Hameed  and  Sampson  (1978)).  For  certain  struc¬ 
tural  conditions  on  |X1| . |Xpj  are  associated  (Ahmed,  Leon 

and  Proschan  (1981))  and  also  PUOD  (Khatri  (1967));  however, 

|XjJ . | Xp |  are  PLOD  (Sid£k  (1967))  for  all  Z^.  Under  certain 

conditions  on  Z x  and  Xy  can  156  shown  that  | X^ | , . . . , | Xp |  are 
more  PLOD  (Sidak  (1968))  and  also  more  PUOD  (Sid£k  (1975))  than 
|Y1|,...,|Yp|. 

Let  X  ~  N ( 0 , Xx )  ,  Y~N(0,Zy),  s2~xx>  u2  ~  and  assume  X,  Y, 

2  2 

s  and  u  are  all  independent.  If  £.x  is  a  nonnegative  matrix, 
then  the  scaled  t-vector,  X,/s,...,X  /s  is  both  PLOD  and  PUOD. 

This  result  follows  from  Theorem  3.2.1  of  Ahmed,  Langberg,  Ledn 
and  Proschan  (1979),  and  the  PLOD  and  PUOD  result  for  X.  If 
— X  ”  — Y  is  a  nonne&ative  matrix  and  Xx  and  Xy  have  the  same  dia¬ 
gonal  elements,  then  X^/s,...,X  /s  is  both  more  PLOD  and  more  PUOD 
than  Yj/u, . . . , Yp/u  (see  Das  Gupta  et  al  (1972,  Theorem  5.1  and 


Remark  5.1)).  Under  any  conditions  that  allow  |X1!,...,|X  | 
to  be  associated,  | X1 | /s , . . . , |X  | /s  will  be  associated  (see 
Abdel-Hameed  and  Sampson,  (1978,  Lemmas  4.1  and  4.2)).  Sidak 
(1971)  showed  that  | X^| /s X| /s  is  PLOD  for  arbitrary 
Z^.  The  analogous  PUOD  result  has  been  established  for 
certain  special  cases  of  by  among  others  Abdel-Hameed 

and  Sampson  (1978,  Theorem  4.2)  and  Ahmed,  Langberg,  Leon  and  Pros- 
chan  (1979,  Sec.  5.6).  If  E^-Zy  is  a  positive  semi-definite 
matrix  then  JX-^l/s,  .  .  .  ,  |  X^ }  /  s  is  more  PLOD  than  |  Y^.  | /u,  .  .  .  ,  |  Y  | /u 

(this  follows  from  Das  Gupta  et  al  (1972,  Theorem  3.3)).  Many 

2  2 

of  these  results  also  hold  when  s  and  u  are  arbitrary  positive 
random  variables ,  or  when  the  denominators  of  the  t-vectors  are 
not  all  the  same  random  variable.  These  scaled  multivariate 
t-distributions  and  their  generalizations  arise  naturally  in  re¬ 
gression  problems,  when  the  sample  regression  coefficients  are 
studied. 

Let(Xil ,  .  .  .  ,Xip) '  ,  i  =  l,...,n,  be  i.i.d.  according  to  N(0,£) , 

where  Z  is  any  covariance  matrix  such  that  |X^  |  , .  .  .  ,  |X^  |  are 

2  2 

associated.  Then  IX^,...,  ZX.p,  which  can  be  considered  (up 

2 

to  scaling)  to  be  a  p-dimensional  multivariate  x  >  are  associated. 
(The  proof  is  a  direct  p-variate  extension  of  Theorem  4.1  of 
Abdel-Hameed  and  Sampson  (1978)).  Similar  results  hold  for  multi¬ 
variate  F-distributions. 

For  distributions  with  an  elliptically  symmetric  density, 

Das  Gupta  et  al  (1972)  give  a  number  of  results  concerning  the 
random  variables  being  more  PLOD  and  more  PUOD.  Sampson  (1982) 
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gives  necessary  and  sufficient  conditions  in  the  bivariate  ellip- 
tically  symmetric  case  for  the  density  to  be  TP2 •  For  a  random 
vector  X  having  a  distribution  with  a  covariance  scale  parameter 
Sampson  (1980)  gives  sufficient  conditions  for  the  association 
of  X. 


3 .  Boole,  Bonferroni  and  Frgchgt 

For  the  special  case  when  the  random  vector  has  entries  which 

are  indicator  functions  of  sets,  there  are  specialized  probability 

inequalities.  Let  IA  (Xi)  =  l,  or  0,  respectively,  as  X.^  e  A.^  or 

Xi^A;j,.  The  simplest  such  inequality  is  the  well-known  Boole's 

P 

inequality  which  states  max  PrCI.  (X. ) = 1]  < Pr[u  (I.  (X.)=l)] 

P  l<i<p  i  1  i-1  i  1 

<  l  PrCI.  (X. )  = 1] .  There  are  a  string  of  more  generalized  in¬ 
is'!  Ai  1 

equalities  called  Bonferroni  inequalities  (see  Feller  (1968)),  the 

best  known  of  which  is  PrCu  (I.  (X.  )  ■  1) ]  >_  1  -  f  PrCI.  (X.)=0]. 

i=l  Ai  1  p  i=l  Ai  1 

This  can  be  extended  by  noting  that  Pr[n  (I.  (X .  )  =  1 )  ]  <_ min 

i=l  Ai  1  l<i<p 

PrCI.  (X.  )=1].  In  the  special  case  when  A.  =(-°°,x.  ], 

H  ^  1  XI 

i  =  l,...,p,  these  latter  two  inequalities  can  be  combined  to  give 
the  multivariate  Fr4ch6t  bounds  which  are  written  as 

max  (0,  f  F.  (x,  )-(p-l) )  <  Fy  . X  (x. . x  )<  min  F.(x.). 

i=l  1  1  “  X1  P  1  P  "  1< i<p  1  1 

This  inequality  provides  bounds  on  the  joint  cumulative  distri¬ 
bution  function  in  terms  of  the  one-dimensional  marginal  cumulative 
distribution . 


4 .  Convex  Symmetric  Set  Inequalities 

There  are  a  number  of  probability  inequalities  involving 
convex  symmetric  sets  which  may  be  viewed  in  some  sense  as  gener- 
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alizations  of  positive  dependence  and  stochastic  ordering.  Ander¬ 
son  (1955)  showed  that  if  X  has  a  density  g(x)  symmetric  around 
the  origin  and  satisfying{(x  :g(x)  ^  t )}  is  a  convex  set  for  all 
t,  0  <  t  <  »,  then  for  any  convex  set  C  symmetric  around  the  origin, 

Pr[X  +  s£  e  C]  is  a  monotone  decreasing  function  of  s,  0£s£l,  for 
every  constant  vector  £.  Khatri  (1967)  showed  for  multivariate  normal 
vector  X^X^iXg)  with  cov(X^,X2)  having  rank  one  that 
PrCX^  e  C^.Xg  e  1 e  ^3  Pr^— 2  e  ^2^  ^or  any  convex  symmetric 

sets  and  C^-  Under  Certain  conditions  on  the  cov(X1(X2), 

Khatri  (1976)  obtained  the  reverse  version  of  this  inequality 
with  replaced  by  ,  and  C1  and  C 2  being  complements  of 

I 

convex  symmetric  sets.  Pitt  (1977)  has  shown  that  if  X=(X1(X2)  , 
where  X~N(0,^)  and  C-^.Cg  are  convex  symmetric  sets,  then 
Pr[X  e  Cj  n  C2]  >Pr[X  e  PrCXeCg].  For  certain  types  of  convex 
sets  involving  quadratic  forms,  Dykstra  (1980)  has  obtained  in¬ 
equalities  for  the  multivariate  normal.  For  instance,  if  Ev  =1 

z 

and  Cov  (X^.Xg)  arbitrary,  then  Pr(X^  e  ^2.’~2-—2  —  c2^—  1  e 

» 

1>r^2— — 2  — c2^  *or  convex  symmetric  sets  and  real  numbers 

2 

c2  — 0>  w^ere  A  is  any  matrix  satisfying  A  =  A. 

5.  Stochastic  Ordering 

Stochastic  ordering  is  a  way  of  comparing  the  relative  sizes 
of  random  variables  (vectors).  For  example,  if  X  and  Y  are  uni¬ 
variate  random  variables  on  the  same  probability  space  one  possible 
definition  of  the  concept  of  X  being  less  than  or  equal  to  Y  is 
to  require  Pr[X  < Y3  *1.  Because  there  is  a  problem  when  X  and  Y 
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are  not  defined  on  the  same  space,  the  usual  definition  of  sto¬ 
chastic  ordering  is  P{X  >  t}  <^P{Y  >  t)  for  all  t.  This  is  written 
st 

X  Y,  i.e.,  X  is  stochastically  less  than  Y.  Moreover,  if 
st 

X  Y,  it  can  be  shown  that  there  exist  random  variables  X  and 

Y  defined  on  the  same  probability  space  with  the  same  marginal 

distributions  as  X  and  Y,  respectively,  such  that  Pr[X^Y]  =  l. 

st 

It  also  can  be  shown  that  X  <_  Y  if  and  only  if  E(<j>(X) )  £  E(  <|>(Y) ) 
for  all  nondecreasing  functions  <J>  (see  Marshall  and  Olkin  (1979, 
p.  193)). 

In  the  multivariate  case,  X  is  stochastically  less  than  Y, 
st 

denoted  by  X  <  Y,  if  E( <(>(X) )  _<  E( <)>(  Y) )  for  all  nondecreasing  func¬ 
tions  <J> ,  where  <f> :  1RP-^3R1.  This  condition  implies,  but  is  not 
equivalent  to,  Y  being  more  PUOD  than  X  (Marshall  and  Olkin 
(1979,  p.  486)).  (See  the  next  xection  for  an  example  of  multi¬ 
variate  stochastic  ordering.)  An  existence  theorem  similar  to 

st 

the  one  in  the  one  dimensional  case  holds,  that  is,  X<^Y  implies 
the  existence  of  component-wise  ordered  random  vectors  on  the 
same  space  with  the  same  marginal  distributions  as  X  and  Y, 
respectively.  See  Arjas  and  Lehtonen  (1978),  and  Marshall  and 
Olkin  (1979)  for  the  proofs  and  discussions  of  these  results 
in  both  the  univariate  and  multivariate  cases. 

6.  Stochastic  Majorization  and  Inequalities 
Obtained  by  Majorization 

One  simple  way  to  define  stochastic  majorization  between 
random  vectors  X  and  Y  is  to  require  Pr[X^  Y]  =  1  where  {  denotes 
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ordinary  majorization .  This  definition  involves  the  joint 
distribution  of  X  and  Y,  and,  hence,  other  definitions  are  pre¬ 
ferable.  An  alternate  definition  is  to  say  that  X  is  stochas¬ 
tically  majorized  by  Y  if  E(4>(X) )  £  E(<j>(Y) )  for  all  Schur  convex 
functions  <J>.  See  Marshall  and  Olkin  (1979,  pp.  281-5  and  311-7) 
for  other  possible  definitions  of  stochastic  majorization  and 
their  interrelationships.  With  the  use  of  these  concepts,  vari¬ 
ous  functions  of  random  vectors  corresponding  to  standard  fami¬ 
lies  can  be  shown  to  be  Schur  convex  and  useful  inequalities 
can  be  obtained  (see  Marshall  and  Olkin  (1979,  Chapter  11)). 
Majorization  techniques  can  also  be  used  to  show  that 
E(<J>(X) )  <  E(i(i(Y) )  for  other  families  of  functions  (see  Marshall 
and  Olkin  (1979,  Chapter  12)).  For  example,  let  Y, ,...,Y  , 

f  t 

Yl’.'.'Yp  be  2P  independent  exponential  random  variables  with 
means  X*1 . X”1 ,  ( X^ )-1 , . . . , (X^)-1 ,  respectively.  Proschan 

and  Sethuraman  (1976)  show  that  if  X^  ,  then  E($( Y^  ,  .  .  .  , Y^) )  <_ 

,  ,  st 

E(<KY1,  .  .  .  ,Yp) )  for  all  nondecreasing  <j> ,  i.e.,  (Y1(...,Yp)  <_ 

(Y^,...,Yp).  Thus,  if  Y*  « ( Y^ , . . . , Y^)  comes  from  a  heterogeneous 

random  sample  with  means  (  X  !!  )-1 ,  i  «  1 . p,  and  Y  =  ( Y^ ,  .  .  .  ,  Y  ) 

comes  from  a  homogeneous  random  sample  with  common  mean  X  * , 
p  st 

where  X  =  (  £  X )/p,  then  Y  <_  y'  ,  since  necessarily 
i=l  1 

( X , . . . , X ) ^ ( X^ , . . . , Xp) .  This  implies  that  all  of  the  order 
statistics  of  the  homogeneous  sample  are  stochastically  smaller 
than  the  corresponding  order  statistics  of  the  heterogeneous 
sample. 


A 
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7.  Chebyshev  and  Kolmogorov-Type  Inequalities 


A  standard  univariate  version  of  the  Chebyshev  inequality 

_  2 

is  Pr[|X-y|_<  ao]  >  1  -a  where  X  has  mean  y  and  finite  var- 
2 

iance  a  .  If  X, . X  are  independent  with  means  y.  and  finite 

2  P  p  1  P  _2 

variance  a  .  ,  i  =  1 ,  . . . , p,  then  Pr[n  {  |  X.  -  y.|<a.o.}]>n  ( 1  -  a .  ) . 

1  i=l  1  1  i=l  1 

If  the  X^  are  dependent,  various  authors  have  obtained  more 

general  inequalities  of  which  the  previous  inequality  is  a  spe¬ 
cial  case.  One  of  the  first  of  these  was  obtained  by  Berge  (1937) 

in  the  bivariate  case.  Let  X^  and  Xg  have  means  y^  and 
2  2 

variances  and  a 2  and  correlation  p.  Then  Pr[|  X^-y ^  |  <  ao^, 

i  2  4  2 

| Xg-y2 l^a  Og]  >  1  -  [1  +  (1  -  p  )!]a  .  Various  multivariate  inequal¬ 
ities  including  the  previous  one  can  be  obtained  from  the  follow¬ 
ing  general  result.  Let.  X=(X, . X  )  have  mean  vector  y  and 

r 

covariance  matrix  _£  =  (p..a.o.).  For  a.  >  0  define  the  matrix 

^  P  P  o 

T  = ( t .  . ) ,  where  t .  .  = P .  ,/a.a  .  Then  Prtn  ( |X  -u. |<a.o. }]>1-  \  a.  . 

-LJ  J-J  A  J  i=l  1  1  -1  i=l  x 

For  other  examples,  see  Tong  (1980,  pp.  153-154)  or  Karlin 

and  Studden  (1966,  pp.  517-519).  Both  of  these  references 

also  give  bounds  for  one-sided  probabilities;  for  example,  lower 

P 

bounds  can  be  obtained  on  probabilities  of  the  form  Pr[n  (X.<y.+ao.)], 

i=l  1  1 

where  X  = (X, . X  )  has  mean  vector  y  and  variances  a? ,  i  =  1 , . . . ,p, 

and  for  certain  a  >  0.  For  background  and  historical  references 
pertaining  to  the  Chebyshev  inequality,  see  Karlin  and  Studden 
(1966,  pp.  467-468).  One  of  the  earliest  books  to  contain  the 
material  on  multivariate  Chebyshev  inequalities  was  Godwin  (1964). 

A  Kolmogorov-type  inequality  is  similar  to  the  above,  except 
that  the  maximum  of  partial  sums  is  employed.  For  example, 


if  . XR  are  independent  and  have  mean  0  and  Sn=X^+...+Xn 

with  a(S  )  *  /Var(  S  )  ,  then  Pr[max  |  S  .  j  ( a ( S  )  )~*<a]  >  1  -  a~^. 
n  l<j<n  J  n  -  - 

A  multivariate  version  of  the  Kolmogorov  inequality  has  been 
obtained  by  Sen  (1971).  For  multivariate  applications  of  the 
univariate  (independent)  result  and  one-sided  analogs,  see  Tong 
(1980,  Section  7.3). 

8.  Multivariate  Moment  Inequalities 

For  the  moments  and  expectations  of  other  functions  of 
multivariate  distributions,  there  are  a  number  of  inequalities. 
The  most  well-known  states  that  is  the  population  covariance 
matrix  of  any  random  vector  if  and  only  if  I  is  nonnegative  de¬ 
finite.  Moreover,  if  is  positive  definite  and  the  sample 
size  large  enough,  the  corresponding  sample  covariance  matrix 
is  positive  definite  with  probability  one  (see  Eaton  and  Perlman 
(1973)).  For  suitable  bivariate  distributions,  there  exists 
a  canonical  expansion  (see  Lancaster  (1969))  and  a  sequence 
(p^  of  canonical  correlations.  This  sequence  {p^}  can  be  shown 
to  satisfy  certain  inequalities,  e.g.,  Griffiths  (1970)  or 
Thomas  and  Tyan  (1975). 

Chebyshev  has  given  the  following  covariance  inequality 
for  similarly  ordered  univariate  functions  ° f  a  random 

vector  X  (see  Hardy,  Littlewood  and  Polya,  (1952,  Sec.  2.17) 
or  Tong,  (1980,  Lemma  2.2.1)).  If  <f>  ^ ,  d>2  satisfy  the  condition 
that  (^(x)  -  <^(2.)  )(<P2(*)  ~  1  0  foT  all  suitable  x,£  then 

Cov(4»^(X),  4>2(X))2.0*  A  number  of  moment  inequalities  can  be 


A 


obtained  from  the  result  that  for  any  nonnegative  random  vector 


X,  whose  distribution  is  invariant  under  permutations,  it  follows 
Xi 

that  E  IT  Xi  is  a  Schur-convex  function  of  (X^,...,A  )  (see  Tong 
( 1980,  Lemma  6.2.4)).  For  example,  if  X  ~  N(u  e  ,  a2( 1-p )^  +  p  e e ' ) , 

,  ZXi  P  X  p  j 

where  e = (1, . . . ,1)  ,  then  EX.  x  >  E  IT  X.  >  E(II  X.)  where 

,  J  i=l  1  “  i=l  1 

Xi  >.  0  >  i  =  l,...,p  and  X  =  p~J'Z  X^ . 

There  are  several  results  relating  the  more  PLOD  ordering 
to  certain  moment  inequalities.  If  (X^.Xg)  is  more  PLOD  than 
(Yi,Y2),  then  any  of  the  following  measures  of  association: 
Pearson's  correlation,  Kendall's  x,  Spearman's  p  or  Blomquist's 
q  computed  based  on  (X^.Xg)  are  greater  than  or  equal  to  the 
corresponding  measure  based  on  (Y^.Yg)  (see  Tchen  (1980)). 

Dykstra  and  Hewett  (1978)  have  examined  the  positive  de¬ 
pendence  properties  of  the  characteristic  roots  of  the  sample 
covariance  matrix.  If  S  is  the  sample  covariance  matrix  based 
upon  a  random  sample  from  N(u,I. )  >  they  show  that  the  ordered 
characteristic  roots  are  associated  random  variables. 


9.  Applications 

Multivariate  probability  inequalities  are  very  important 

for  simultaneous  confidence  bounds,  where  lower  bounds  are  sought 

P 

on  probabilities  of  events  such  as  n  { | © .  -  0 . |  <  c .  } ,  where  the 

i=l  1  1  ~  1 

estimators  0^ . 0p  have  some  multivariate  distribution  pos¬ 

sibly  depending  on  nuisance  parameters.  The  basic  concept  is 
to  bound  this  probability  by  probabilities  of  marginal  events, 
where  no  parameters  are  involved.  For  instance,  if  X~N(y,jE), 


the  fact  that  lxi  "  I  .  •  •  • »  lxp  -  yp  |  are  PLOD  provides  conserva¬ 
tive  simultaneous  confidence  intervals  for  when  Xi , 

i  =  l,...,p  are  known.  General  discussions  of  applications  of 
probability  inequalities  to  simultaneous  inference  can  be  found 
in  Miller  (1981),  Krishnaiah  (1979),  and  Tong  (1980).  Also 
found  in  the  latter  are  applications  of  these  techniques  for 
establishing  unbiasedness  for  certain  multivariate  tests  of 
hypothesis . 

Many  of  the  dependence  inequalities  are  useful  in  applica¬ 
tions  ot  reliability  theory.  Consider  a  nonrepairable  binary 

system  consisting  of  p  binary  components  with  lifetimes  T, . T 

and  having  system  lifetime  T.  The  system  lifetime  T  is  generally 

a  function  of  the  component  lifetimes  such  as  T  =  max  min  Tt 

l<r<k  iePr 

where  the  Pr  are  min  path  sets  (see  Barlow  and  ProscKan  (1975, 
Chapters  1  and  2,  and  p.  150)).  In  general  the  T^  are  not  inde¬ 
pendent  and  it  is  desired  to  approximate  PrCTeB]  by  the  PrCT^B^], 
i  =  l,...,p,  where  B  and  the  B^s  are  usually  intervals.  To  do 
this,  various  univariate  and  multivariate  inequalities  are  used. 

As  a  simple  example,  if  the  T.  are  PUOD,  then  Pr[T>t]  = 
k  1 

PrCu  n  (T,  >  t )  ]  >  max  PrCn  (T.  >t)]>^  max  H  PrCT .  >  t ) , 
r=l  ieP^,  l<r<k  ieP„  l<r<k  ieP_ 

where  the  first  inequality  follows  from  Boole's  inequality  and 
the  second  follows  from  PUOD.  If  the  distributions  of  the  Ti 
are  not  known  but  the  T^  lie  in  a  class  of  wearout  distributions, 
e.g.,  T.^  has  increasing  failure  rate,  lower  bounds  on  Pr[T>t] 
can  be  found  in  terms  of  the  bounds  on  PrC^  >t].  For  example, 
Theorem  6.7  of  Barlow  and  Proschan  (1975,  Chapter  4)  can  be  em- 
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ployed  in  such  an  application.  Many  other  applications  of  this 
type  are  contained  in  Barlow  and  Proschan  (1975,  Section  4.6). 
Generalizations  of  this  type  of  result  to  multistate  systems 
have  been  given  by  Block  and  Savits  (1982). 
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